Web - based Supplementary Materials for “ More powerful genetic association testing via a new statistical framework for integrative genomics ”
نویسندگان
چکیده
In Section 3 we showed that our method can have more power to detect o-eSNPs. Here we discuss its power to detect SNPs whose functional mechanisms have non-regulatory components. For simplicity we again consider only continuous Yi, Gi, and a single SNP Si in the ordinary linear model, where the variables have all been centered. We now consider the outcome model Yi = G T i αG + αSSi + i1, where the direct effect αS is nonzero. The transcript model remains Gi αG = βSSi + i2. We again compare to the usual approach of fitting Yi = β ∗ SSi +N(0, σ ∗2). We are interested in comparing the power of tests based on β̂S and β̂ ∗ S under this direct effect model, so we study βS/var (β̂S) and β ∗ S/var (β̂ ∗ S). First, we still have var (β̂ ∗ S) = (σ 2 1 + σ 2)/var (Si). To calculate var (β̂S) note that Step 1 of our integrative procedure is equivalent to marginalizing over Si in the true outcome model, which gives E(Yi | Gi) = Gi αG + αSE(Si | Gi) and var (Yi | Gi) = α Svar (Si | Gi) + σ 1. Without knowing more about the distribution of Si given Gi it is hard to draw further conclusions. But when Si is weakly correlated with Gi, the outcome model is approximately correctly specified and var (β̂S) ≈ σ 2/var (Si) + {σ 1 + α Svar (Si | Gi)}ΣSGΣ GGΣGS/var (Si). (1) Second, β∗ S = αS + βS. Thus when αS is large relative to βS with the same sign, standard analysis will be more powerful. Otherwise, denoting c = ΣSGΣ −1 GGΣGS/var (Si), var (β̂S) < var (β̂∗ S) if α 2 S < σ 2 1(1 − c)/var (Si | Gi)c. Since c is small when Si and Gi are weakly correlated and σ 1 tends to be large, our approach will still have more power unless αS is very large, though when it is so large that αSE(Si | Gi) is not close to zero the outcome model will not be approximately correctly specified.
منابع مشابه
More powerful genetic association testing via a new statistical framework for integrative genomics.
Integrative genomics offers a promising approach to more powerful genetic association studies. The hope is that combining outcome and genotype data with other types of genomic information can lead to more powerful SNP detection. We present a new association test based on a statistical model that explicitly assumes that genetic variations affect the outcome through perturbing gene expression lev...
متن کاملA retrospective likelihood approach for efficient integration of multiple omics factors in case-control association studies.
Integrative omics, the joint analysis of outcome and multiple types of omics data, such as genomics, epigenomics, and transcriptomics data, constitute a promising approach for powerful and biologically relevant association studies. These studies often employ a case-control design, and often include nonomics covariates, such as age and gender, that may modify the underlying omics risk factors. A...
متن کاملEpiviz: Integrative Visual Analysis Software for Genomics
Title of Dissertation: Epiviz: INTEGRATIVE VISUAL ANALYSIS SOFTWARE FOR GENOMICS Florin Chelaru, Doctor of Philosophy, 2015 Directed By: Professor Héctor Corrada Bravo Department of Computer Science Computational and visual data analysis for genomics has traditionally involved a combination of tools and resources, of which the most ubiquitous consist of genome browsers, focused mainly on integr...
متن کاملRegularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics.
In genetical genomics studies, it is important to jointly analyze gene expression data and genetic variants in exploring their associations with complex traits, where the dimensionality of gene expressions and genetic variants can both be much larger than the sample size. Motivated by such modern applications, we consider the problem of variable selection and estimation in high-dimensional spar...
متن کاملPharmacogenomic approach in type 2 diabetes treatment
Introduction: Type 2 diabetes (T2D) is chronic health caused by the interaction between genetic and environmental factors that results in high blood glucose. The evidence-based guidelines for diabetes management are mainly based on lifestyle changes, control of risk factors, and the management of blood glucose levels. Although numerous antidiabetic agents have been developed over time, T2D trea...
متن کامل